News Selection with Topic Modeling
نویسنده
چکیده
There are numerous news articles coming to news aggregators and important news are selected to be presented on the front-page. There are two types of news selection for the front-page of news aggregators: personalized and public news recommendation (selection). This study examines public news recommendation that aims to satisfy all users’ interest on the front-page. Public news recommendation is mainly done by meta-features like news popularity. A different approach that exploits the news content is introduced in this work. The main target is to select important (significant) news articles while providing diversification in the selected news topics. A new approach based on topic modeling is developed for this purpose. Results show that it is hard to achieve satisfactory level of precision when content-based public news recommendation is applied. However, precision of topic modeling-based approach is noticeably better than precision of random news recommendation. Topics of selected news are also diversified by using topic modeling.
منابع مشابه
A New Document Embedding Method for News Classification
Abstract- Text classification is one of the main tasks of natural language processing (NLP). In this task, documents are classified into pre-defined categories. There is lots of news spreading on the web. A text classifier can categorize news automatically and this facilitates and accelerates access to the news. The first step in text classification is to represent documents in a suitable way t...
متن کاملA front-page news-selection algorithm based on topic modelling using raw text
Front-page news selection is the task of finding important news articles in news aggregators. In this study, we examine news selection for public front pages using raw text, without any meta-attributes such as click counts. A novel algorithm is introduced by jointly considering the importance and diversity of selected news articles and the length of front pages. We estimate the importance of ne...
متن کاملModeling the Impact of News on volatility The Case of Iran
In this paper various ARCH models and relevant news impact curves including a partially nonparametric (PNP) one are compared and estimated with daily Iran stock return data. Diagnostic tests imply the asymmetry of the volatility response to news. The EGARCH model, which passes all the tests and appears relatively matching with the asymmetry in the data, seems to be the most adequate characteriz...
متن کاملA Cross-Cultural Pragmatic Study of Indirect Complaint Responses in Iranian and American News Interviews: Iran’s Nuclear Negotiations
The present study intended to compare the complaint responses used by President Rouhani and President Obama in the Iranian and US news interview contexts. For this purpose, Boxer’s (1993) six types of indirect complaint responses were adopted: ‘ignorance’, ‘questions’, ‘topic switch’, ‘contradiction’, ‘joke/teasing’, ‘advice/lecture’ and ‘agreement/commiseration’. The transcripts of the live ne...
متن کاملTopic-Focused Summarization of News Events Based on Biased Snippet Extraction and Selection
In this paper, we propose a framework to produce topic-focused summarization of news events, based on biased snippet extraction and selection. Through our approach, a summarization only retaining information related to a predefined topic (e.g. economy or politics) can be generated for a given news event to satisfy users with specific interests. To better balance coherence and coverage of the su...
متن کامل